Picture for Bumsub Ham

Bumsub Ham

Improving Visual Token Reduction via Rectifying Distortions for Efficient Multimodal LLM Inference

Add code
Jun 01, 2026
Viaarxiv icon

Exploring Hierarchical Consistency and Unbiased Objectness for Open-Vocabulary Object Detection

Add code
Apr 25, 2026
Viaarxiv icon

Relational Feature Caching for Accelerating Diffusion Transformers

Add code
Feb 23, 2026
Viaarxiv icon

GrowTAS: Progressive Expansion from Small to Large Subnets for Efficient ViT Architecture Search

Add code
Dec 13, 2025
Viaarxiv icon

AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models

Add code
Oct 23, 2025
Viaarxiv icon

Jailbreaking on Text-to-Video Models via Scene Splitting Strategy

Add code
Sep 26, 2025
Viaarxiv icon

Subnet-Aware Dynamic Supernet Training for Neural Architecture Search

Add code
Mar 13, 2025
Viaarxiv icon

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

Add code
Feb 10, 2025
Figure 1 for ELITE: Enhanced Language-Image Toxicity Evaluation for Safety
Figure 2 for ELITE: Enhanced Language-Image Toxicity Evaluation for Safety
Figure 3 for ELITE: Enhanced Language-Image Toxicity Evaluation for Safety
Figure 4 for ELITE: Enhanced Language-Image Toxicity Evaluation for Safety
Viaarxiv icon

Maximizing the Position Embedding for Vision Transformers with Global Average Pooling

Add code
Feb 05, 2025
Viaarxiv icon

Efficient Few-Shot Neural Architecture Search by Counting the Number of Nonlinear Functions

Add code
Dec 19, 2024
Viaarxiv icon